Delving Deeper into Anti-Aliasing in ConvNets
نویسندگان
چکیده
Aliasing refers to the phenomenon that high frequency signals degenerate into completely different ones after sampling. It arises as a problem in context of deep learning downsampling layers are widely adopted architectures reduce parameters and computation. The standard solution is apply low-pass filter (e.g., Gaussian blur) before (Zhang in: ICML, 2020). However, it can be suboptimal same across entire content, feature maps vary both spatial locations channels. To tackle this, we propose an adaptive content-aware filtering layer, which predicts separate weights for each location channel group input maps. We investigate effectiveness generalization proposed method multiple tasks, including image classification, semantic segmentation, instance video image-to-image translation. Both qualitative quantitative results demonstrate our approach effectively adapts frequencies avoid aliasing while preserving useful information recognition. Code available at https://maureenzou.github.io/ddac/ .
منابع مشابه
Delving deeper into MALT lymphoma biology.
Mucosa-associated lymphoid tissue (MALT) lymphomas can arise in a variety of extranodal sites. Interestingly, at least 3 different, apparently site-specific, chromosomal translocations, all affecting the NF-kappaB pathway, have been implicated in the development and progression of MALT lymphoma. The most common is the translocation t(11;18)(q21;q21), which results in a fusion of the cIAP2 regio...
متن کاملDelving Deeper into MOOC Student Dropout Prediction
In order to obtain reliable accuracy estimates for automatic MOOC dropout predictors, it is important to train and test them in a manner consistent with how they will be used in practice. Yet most prior research on MOOC dropout prediction has measured test accuracy on the same course used for training the classifier, which can lead to overly optimistic accuracy estimates. In order to understand...
متن کاملDelving Deeper into Convolutional Networks for Learning Video Representations
We propose an approach to learn spatio-temporal features in videos from intermediate visual representations we call “percepts” using Gated-Recurrent-Unit Recurrent Networks (GRUs). Our method relies on percepts that are extracted from all levels of a deep convolutional network trained on the large ImageNet dataset. While high-level percepts contain highly discriminative information, they tend t...
متن کاملDelving Deeper into Convolution Networks for Learning Video Representation
Video analysis and understanding represents a major challenge for computer vision and machine learning research. While previous work has traditionally relied on hand-crafted and task-specific representations, there is a growing interest in designing general video representations that could help solve tasks in video understanding such as human action recognition, video retrieval or video caption...
متن کاملDelving deeper into the whorl of flower segmentation
We describe an algorithm for automatically segmenting flowers in colour photographs. This is a challenging problem because of the sheer variety of flower classes, the variability within a class and within a particular flower, and the variability of the imaging conditions – lighting, pose, foreshortening, etc. The method couples two models – a colour model for foreground and background, and a li...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computer Vision
سال: 2022
ISSN: ['0920-5691', '1573-1405']
DOI: https://doi.org/10.1007/s11263-022-01672-y